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1 Rav Sequence Listing 

Patent implication US/07/659,408 



SEQTre»fCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: CAPUT, DANIEL 

FERRARA, PASCUAL 
GUILLEMOT, JEAN-CLAUDE 
KAGHAO, MOXJRAD 
LEGOUX, RICHARD 
LOISON, GERARD 
LARBRE, ELIZABETH 
LUPKER, JOHANNES 
LEPLATOIS, PASCUAL 
SALOME, MARK 



(11) TITLE OF INVENTION: URATE OXIDASE ACTIVITY PROTEIN, 

RECOMBINANT GENE CODING THEREFOR, EXPRESSION VECTOR, 
MICRO-ORGANISMS AND TRANSFORMED CELLS 



(111) NimBER OF SEQUENCES: 35 



(Iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Foley & Lardner 

(B) STREET: 1800 Diagonal Road, Suite 500 

(C) CITY: Alexandria 

(D) STATE: Virginia 

(E) COUNTRY: USA 

(F) ZIP: 22313-0299 



(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 



(vl) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/659,408 

(B) FILING DATE: 25-APR-1991 

(C) CLASSIFICATION: 



(vlll) ATTORNEY/AGENT INFORMATION: 

(A) NAME: BENT, Stephen A. 

(B) REGISTRATION NUMBER: 29,768 

(C) REFERENCE/DOCKET NUMBER: 16781/276 BEDL 



(Ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (703)836-9300 

(B) TELEFAX: (703)683-4109 

(C) TELEX: 899149 



(2) INFORMATION FOR SEQ ID NO:l: 
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54 

55 (1) SEQUENCE CHARACTERISTICS: 



56 (A) LENGTH: 301 amino acids 

57 (B) TYPE: amino acid 

58 (D) TOPOLOGY: linear 
59 



60 (ii) MOLECULE TYPE: protein 

61 

62 (iii) HYPOTHETICAL: NO 

63 

64 (vi) ORIGINAL SOURCE: 

65 (A) ORGANISM: Aspergillus flavus 
66 

67 (vii) IMMEDIATE SOURCE: 

68 (B) CLONE: Urate oxidase 
69 

70 

71 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

72 

73 Ser Ala Val Lys Ala Ala Arg Tyr Gly Lys Asp Asn Val Arg Val Tyr 

74 1 5 10 15 
75 

76 Lys Val His Lys Asp Glu Lys Thr Gly Val Gin Thr Val Tyr Glu Met 

77 20 25 30 
78 

79 Thr Val Cys Val Leu Leu Glu Gly Glu lie Glu Thr Ser Tyr Thr Lys 

80 35 40 45 
81 

82 Ala Asp Asn Ser Val lie Val Ala Thr Asp Ser lie Lys Asn Thr lie 

83 50 55 60 
84 

85 Tyr lie Thr Ala Lys Gin Asn Pro Val Thr Pro Pro Glu Leu Phe Gly 

86 65 70 75 80 
87 

88 Ser lie Leu Gly Thr His Phe lie Glu Lys Tyr Asn His lie His Ala 

89 85 90 95 
90 

91 Ala His Val Asn lie Val Cys His Arg Trp Thr Arg Met Asp lie Asp 

92 100 105 110 
93 

94 Gly Lys Pro His Pro His Ser Phe lie Arg Asp Ser Glu Glu Lys Arg 

95 115 120 125 
96 

97 Asn Val Gin Val Asp Val Val Glu Gly Lys Gly lie Asp He Lys Ser 

98 130 135 140 
99 

100 Ser Leu Ser Gly Leu Thr Val Leu Lys Ser Thr Asn Ser Gin Phe Trp 

101 145 150 155 160 
102 

103 Gly Phe Leu Arg Asp Glu Tyr Thr Thr Leu Lys Glu Thr Trp Asp Arg 

104 165 170 175 
105 

106 He Leu Ser Thr Asp Val Asp Ala Thr Trp Gin Trp Lys Asn Phe Ser 
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107 






180 








185 




190 




108 






















t no 
109 


Oly 


Leu Gin 


Glu 


Val Arg 


Ser 


His 


Val 


Pro Lys 


Phe Asp Ala Thr Trp 


110 




19S 








200 






205 




lit 
111 






















1 1 o 

1 Iz 


Kid 

Ala 


xniT Ala 


Arg 


Glu Val 


Thr Leu Lys 


Thr Phe 


Ala Glu Asp Asn 


Ser 


113 




210 






215 








220 




114 






















tic 


Ala 


Ser Val 


Gin 


Ala Thr 


Met 


Tyr 


Lys 


Met 'Ala Glu Gin He Leu 


Ala 


116 


225 






230 








235 




240 


117 






















llo 




uin Gin 


Leu 


He Glu 


Thr 


Val 


Glu 


Tyr Ser 


Leu Pro Asn Lys 


His 


119 








245 








250 


255 




120 






















121 


Tyr 


Phe Glu 


He 


Asp Leu 


Ser 


Trp 


His 


Lys Gly Leu Gin Asn Thr Gly 


Izz 






260 








265 




270 




123 






















124 


Lys 


Asn Ala 


Glu 


Val Phe 


Ala 


Pro 


Gin 


Ser Asp 


Pro Asn Gly Leu 


He 


IOC 

125 




275 








280 






285 




126 






















127 


Lys 


Cys Thr 


Val 


Gly Arg 


Ser 


Ser 


Leu 


Lys Ser 


Lys Leu 




128 




290 






295 








300 




129 


(2) INFORMATION FOR SEQ ID NO: 2: 












130 






















131 


(i) 


SEQUENCE CHARACTERISTICS: 










132 




(A) LENGTH; 


1 302 amino acids 








1 

133 




(B) TYPE: amino acid 












134 




(C) STRANDEONESS: single 










133 




(D) TOPOLOGY: linear 












13d 






















137 


(ii) 


MOLECULE TYPE: protein 












too 
13o 






















137 


(iii) 


HYPOTHETICAL: NO 














1 >i n 
140 






















1 A1 
141 


(vi) 


ORIGINAL SOURCE: 














142 




(A) ORGANISM: Aspergillus flavus 






143 






















144 


(vii) 


IMMEDIATE SOURCE: 














145 




(B) CLONE: 


Met-Urate oxidase 








146 






















1 A"T 

14 / 






















148 


(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 2: 






149 






















150 


Met 


Ser Ala 


Val 


Lys Ala 


Ala Arg Tyr Gly Lys Asp Asn Val Arg Val 


151 


1 






5 








10 


15 




152 






















153 


Tyr 


Lys Val 


His 


Lys Asp Glu Lys 


Thr Gly Val 


Gin Thr Val Tyr 


Glu 


154 






20 








25 




30 




155 






















156 


Met 


Thr Val 


Cys 


Val Leu 


Leu 


Glu 


Gly Glu He 


Glu Thr Ser Tyr 


Thr 


157 




35 








40 






45 




158 






















159 


Lys 


Ala Asp Asn 


Ser Val 


He 


Val 


Ala 


Thr Asp 


Ser He Lys Asn 


Thr 
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160 50 55 60 

161 

162 He Tyr He Thr Ala Lys Oln Asn Pro Val Thr Pro Pro Glu Leu Phe 

163 65 70 75 80 
164 

165 Oly Ser He Leu Gly Thr His Phe He Glu Lys Tyr Asn His He His 

166 85 90 95 
167 

168 Ala Ala His Val Asn He Val Cys His Arg Trp Thr Arg Met Asp He 

169 100 105 110 
170 

171 Asp Oly Lys Pro His Pro His Ser Phe He Arg Asp Ser Glu Olu Lys 

172 115 120 125 
173 

174 Arg Asn Val Oln Val Asp Val Val Glu Gly Lys Gly He Asp He Lys 

175 130 135 140 
176 

177 Ser Ser Leu Ser Gly Leu Thr Val Leu Lys Ser Thr Asn Ser Oln Phe 

178 145 150 155 160 
179 

180 Trp Gly Phe Leu Arg Asp Glu Tyr Thr Thr Leu Lys Glu Thr Trp Asp 

181 165 170 175 
182 

183 Arg He Leu Ser Thr Asp Val Asp Ala Thr Trp Gin Trp Lys Asn Phe 

184 180 185 190 
185 

186 Ser Oly Leu Oln Glu Val Arg Ser His Val Pro Lys Phe Asp Ala Thr 

187 195 200 205 
188 

189 Trp Ala Thr Ala Arg Glu Val Thr Leu Lys Thr Phe Ala Glu Asp Asn 

190 210 215 220 
191 

192 Ser Ala Ser Val Gin Ala Thr Met Tyr Lys Met Ala Glu Gin He Leu 

193 225 230 235 240 
194 

195 Ala Arg Gin Gin Leu He Glu Thr Val Glu Tyr Ser Leu Pro Asn Lys 

196 245 250 255 
197 

198 His Tyr Phe Glu He Asp Leu Ser Trp His Lys Gly Leu Gin Asn Thr 

199 260 265 270 
200 

201 Gly Lys Asn Ala Glu Val Phe Ala Pro Gin Ser Asp Pro Asn Gly Leu 

202 275 280 285 
203 

204 He Lys Cys Thr Val Gly Arg Ser Ser Leu Lys Ser Lys Leu 

205 290 295 300 
206 

207 (2) INFORMATION FOR SEQ ID NO: 3: 
208 

209 (i) SEQUENCE CHARACTERISTICS: 

210 (A) LENGTH: 906 base pairs 

211 (B) TYPE: nucleic acid 

212 (C) STRANDEDNESS: single 
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213 (D) TOPOLOGY: linear 

214 

215 (ii) MOLECULE TYPE: DNA (genomic) 

216 

217 (vii) IMMEDIATE SOURCE: 

218 (B) CLONE: Preferred sequence for expression in 

219 prokaryotes 
220 

221 

222 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

223 

224 AT0TCT0C60 TAAAA6CAGC 6CGCTACGGC AAGGACAATO TTCGC6TCTA CAAOGTTCAC 60 
225 

226 AAGGACGAGA AOACCG6TGT CCAGACG6TG TACGAGATGA CCGTCTGTGT GCTTCTG6A6 120 
227 

228 GGTGAOATTG AOACCTCTTA CACCAAGGCC GACAACAGCG TCATTGTCGC AACCGACTCC 180 
229 

230 ATTAAGAACA CCATTTACAT CACCGCCAAG CAGAACCCCG TTACTCCTCC CGAGCTGTTC 240 
231 

232 GGCTCCATCC TGGGCACACA CTTCATTGAG AAGTACAACC ACATCCATGC CGCTCACGTC 300 
233 

234 AACATTGTCT GCCACCGCTG GACCCGGATG GACATTGACG GCAAGCCACA CCCTCACTCC 360 
235 

236 TTCATCCGCG ACAGCGAGGA OAAGCGGAAT GTGCAGGTGG ACGTGGTCOA GGGCAAGGGC 420 
237 

238 ATC6ATATCA AGTCGTCTCT GTCCGGCCTG ACCGTGCTGA AGAGCACCAA CTCGCAGTTC 480 
239 

240 IGGGGCTTCC TGCGTGACGA GTACACCACA CTTAAGGA6A CCTGGGACCG TATCCTGAGC 540 
241 

242 ACCGAC6TCG ATGCCACTTG GCAGIGGAAG AATTTCAGTG GACTCCAGGA GGTCCGCTCG 600 
243 

244 CACGTGCCTA AGTTCGATGC TACCTGGGCC ACTGCTCGCG AGGTCACTCT GAAOACTTTT 660 
245 

246 GCTGAAGATA ACAGTGCCAG C6T6CAGGCC ACTATGTACA AGATGGCAGA GCAAATCCTG 720 
247 

248 GCGCGCCAGC AGCTGATCGA GACTGTCGAO TACTCGTTOC CTAACAAGCA CTATTTCGAA 780 
249 

250 ATCGACCTGA GCTGGCACAA GGGCCTCCAA AACACCGGCA AGAACGCCGA GGTCTTCGCT 840 
251 

252 CCTCAGTCGG ACCCCAACGG TCT6ATCAAG TGIACCGTCG GCCGOTCCTC TCTGAAGTCT 900 
253 

254 AAATTG 906 
255 

256 (2) INFORMATION FOR SEQ ID NO: 4: 

257 (i) SEQUENCE CHARACTERISTICS: 

258 (A) LENGTH: 906 base pairs 

259 (B) TYPE: nucleic acid 

260 (C) STRANDEDNESS: single 

261 (D) TOPOLOGY: linear 
262 

263 (ii) MOLECULE TYPE: DNA (genomic) 

264 
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266 
267 
268 
269 
270 



(vii) IMMEDIATE SOXmCE: 

(B) CLONE: Preferred sequence for expression in 
eukaryotes 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



271 

272 AT6TCTOCT6 TTAAGGCTGC TAOATACGGT AA06ACAACO TTAOAOTCTA CAAGGTTCAC 
273 

274 AAGOACOAOA AGACCGGT6T CCAOACGOT6 IAC6A0AT6A CC6TCTGTGI GCTTCTOGAG 
275 

276 GGTOAGATTG AGACCTCTTA CACCAAOGCC GACAACAGCG TCATT6TCGC AACCGACTCC 
277 

278 ATTAAGAACA CCATTTACAT CACCGCCAAG CA6AACCCCO TTACTCCTCC CGAGCT6TTC 
279 

280 6GCTCCATCC TGGGCACACA CTTCATTGAG AA6TACAACC ACATCCATOC COCTCACGTC 
281 

282 AACATTGTCT 6CCACCGCTG GACCCGGAT6 GACATTGACG GCAAGCCACA CCCTCACTCC 
283 

284 TTCATCCGCG ACAGCGAGGA 6AA6CGGAAT GIGCA06TGG ACGTGGTCGA OGGCAAGGGC 
285 

286 ATCGATATCA AGTCGTCTCT GTCCGOCCTO ACCGTGCTGA AGAGCACCAA CXCGCAGTTC 
287 

288 TGGG6CTTCC TGCGXGACGA GTACACCACA CTTAAGGA6A CCTG6GACCG TATCCTGAGC 
289 

290 ACCOACGTCG AT6CCACTTG GCAGTGGAAG AATTTCAGTG GACTCCAOGA G6TCCGCTCG 
291 

292 CACGTGCCTA AGTTCGATOC TACCTGG6CC ACTGCICGCG AOGTCACTCT GAAGACTTTT 
293 

294 GCTGAA6ATA ACAGTGCCAG CGTGCA6GCC ACTATGTACA AGAT6GCAGA GCAAATCCTG 
295 

296 GC6C6CCAGC AOCTGATCGA GACTGTCGAO TACTCGTTGC CTAACAAGCA CTATTTC6AA 
297 

298 AICGACCTGA GCTGGCACAA GG6CCTCCAA AACACCGGCA AGAACOCCGA GGTCTTCGCT 
299 

300 CCTCAGICOG ACCCCAACOG TCTGATCAAG IGTACCGTCG GCCGGICCTC TCTGAAGXCT 
301 

302 AAAITG 
303 

304 (2) INFORMATION FOR SEQ ID NO: 5: 
305 

306 (i) SEQUENCE CHARACTERISTICS: 

307 (A) LENGTH: 14 base pairs 

308 (B) TYPE: nucleic acid 

309 (C) STRANDEDNESS: single 

310 (D) TOPOLOGY: linear 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
906 



311 
312 
313 
314 
315 
316 
317 
318 



(iii) 



(vii) 



(ii) 



MOLECULE TYPE: DNA (genomic) 



HYPOTHETICAL: NO 



IMMEDIATE SOURCE: 
(B) CLONE: Preferred non-translated 5' sequence for 
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319 animal cells 

320 

321 (xi) SEQtnSNCE DESCRIPTION: SEQ ID NO: 5: 

322 A6CTTOCCOC CACT 14 
323 

324 (2) INFORMATION FOR SEQ ID NO: 6: 
325 

326 (1) SEQUENCE CHARACTERISTICS: 

327 (A) LENGTH: 906 base pairs 

328 (B) TYPE: nucleic acid 

329 (C) STRANDEDNESS : single 

330 (D) TOPOLOGY: linear 
331 

332 (ii) MOLECULE TYPE: DNA (genomic) 

333 

334 (iii) HYPOTHETICAL: NO 

335 

336 

337 (vii) IMMEDIATE SOURCE: 

338 (B) CLONE: Preferred sequence for expression in animal 

339 cells 
340 

341 

342 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

343 

344 ATGTCC6CAG TAAAAGCAGC CCGCTACGGC AA06ACAATG TCCGCGTCTA CAAOGTTCAC 60 
345 

346 AAGGACGAGA AGACC6GTGT CCAGAC6GTG TACGAOATOA CCGTCTGTGT 6CTTCTGGAG 120 
347 

348 GGTGAGATTG AGACCTCTTA CACCAA60CC GACAACAGCG TCATTGTCGC AACCGACTCC 180 
349 

350 ATTAAGAACA CCATTTACAT CACCGCCAAG CAGAACCCCG TTACTCCTCC CGAGCTOTTC 240 
351 

352 GGCTCCATCC TGGGCACACA CTTCATTGAG AAGTACAACC ACATCCATGC CGCTCACGTC 300 
353 

354 AACATTGTCT GCCACCGCTG GACCCGGATG GACATTGACO GCAAOCCACA CCCTCACTCC 360 
355 

356 TTCATCCGCG ACAGCGAGGA GAAGCGGAAT GTGCAGGT60 ACGTGGTCGA 6GGCAAGGGC 420 
357 

358 ATCGATATCA AGTCGTCTCT GTCCGGCCTG ACCGT6CTGA AGAGCACCAA CTCGCAGTTC 480 
359 

360 TGGGGCTTCC TGCGTGACGA GTACACCACA CTTAA6GAGA CCT6GGACC6 TATCCTGAGC 540 
361 

362 ACCGACGTCG ATGCCACTTG GCAGTGGAAG AATTTCAGTG GACTCCAGGA GGTCCGCTCG 600 
363 

364 CACGTGCCTA AGTTCGATGC TACCTGGGCC ACTGCTCGCG AGGTCACTCT GAAGACTTTT 660 
365 

366 GCTGAAGATA ACAGT6CCAG CGTGCAGGCC ACTATGTACA AGATGGCAGA GCAAATCCTG 720 
367 

368 GCGCGCCAGC AGCTGATCGA GACTGTCGAG TACTCGTTGC CTAACAAGCA CTATTTCGAA 780 
369 

370 ATCGACCTGA GCTGGCACAA GGGCCTCCAA AACACCGGCA AGAACGCCGA GGTCTTCGCT 840 
371 
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372 
373 
374 
375 
376 
377 
378 
379 
380 
381 
382 
383 
384 
385 
386 
387 
388 
389 
390 
391 
392 
393 
394 
395 
396 
397 
398 
399 
400 
401 
402 
403 
404 
405 
406 
407 
408 
409 
410 
411 
412 
413 
414 
415 
416 
417 
418 
419 
420 
421 
422 
423 
424 
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CCTCAGICGG ACCCCAACOO TCTOATCAAO I6IACC6ICO OCCGGICCTC ICTOAA6ICI 900 
AAATTO 906 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: UNA (genomic) 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: reverse transcription primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GATCC6GGCC CTTTTTTTTT TTT 23 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Hydrolysis product T 17 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Asn Val Gin Val Asp Val Val Glu Gly Lys 
15 10 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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425 

426 (ii) MOLECXILE TYPE: peptide 

427 

428 (lii) HYPOTHETICAL: NO 

429 

430 (vii) IMMEDIATE SOURCE: 

431 (B) CLONE: Hydrolysis product T 20 
432 

433 

434 (xl) SEQTJENCE DESCRIPTION: SEQ ID NO: 9: 

435 

436 Asn Phe Ser Gly Leu 61n Olu Val 

437 1 5 
438 

439 (2) INFORMATION FOR SEQ ID NO: 10: 
440 

441 (i) SEQUENCE CHARACTERISTICS: 

442 (A) LENGTH: 6 amino acids 

443 (B) TYPE: amino acid 

444 (C) STRANDEDNESS : single 

445 (D) TOPOLOGY: linear 
446 

447 (11) MOLECULE TYPE: peptide 

448 

449 (iii) HYPOTHETICAL: NO 

450 

451 

452 (vii) IMMEDIATE SOURCE: 

453 (B) CLONE: Hydrolysis product T 23 
454 

455 

456 (xl) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

457 

458 Phe Asp Ala Thr Trp Ala 

459 1 5 
460 

461 (2) INFORMATION FOR SEQ ID NO: 11: 
462 

463 (1) SEQUENCE CHARACTERISTICS: 

464 (A) LENGTH: 8 amino acids 

465 (B) TYPE: amino acid 

466 (C) STRANDEDNESS: single 

467 (O) TOPOLOGY: linear 
468 

469 (11) MOLECULE TYPE: peptide 

470 

471 (iii) HYPOTHETICAL: NO 

472 

473 

474 (vii) IMMEDIATE SOURCE: 

475 (B) CLONE: Hydrolysis product T 27 
476 

477 
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V f o 


(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 11: 


IB f y 








His 


Tyr Phe Olu lie Asp Leu Ser 


vox 


1 


5 


482 








(2) INFORMATION FOR SEQ ID NO: 12: 


AAA 








I*/ 




AAA 




\nf jjciMvxit* ainxiio acxus 


AA7 




/n\ TV PR • Aminn ar^4H 






\^) a±i\AnuBiuntiooi sxn^xe 


ADO 






Aon 






A4 1 


(ii) 


MOLECULE TYPE: peptide 








V 7 w 


(iii) 


HYPOTHETICAL: NO 


AO A 






47 3 






AQ£ 
v70 


(vii) 


IMMEDIATE SOURCE: 


AQ7 
47 / 




(B) CLONE: Hydrolysis product T 28 


AQfi 

V 7 O 






AQQ 

V 7 7 








(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 12: 


SO 1 
•J li X 








lie 


Leu Ser Thr Asp Val Asp Ala Thr Trp 




1 


5 10 


ROA 








(2) INFORMATION FOR SEQ ID NO: 13: 








311 / 


(i) 


SEQUENCE CHARACTERISTICS: 


3Uo 




(A) LENGTH: 11 amino acids 


509 




(B) TYPE: amino acid 


510 




(C) STRANDEDNESS: single 


R 1 1 

3 X X 




(D) TOPOLOGY: linear 


*? 1 9 

9 X A 






3 X w 


(ii) 


NOLECXILE TYPE: peptide 


S 1 A 
3X4 






515 


(iii) 


HYPOTHETICAL: NO 


sift 

3 XO 






3 X / 






3 XO 


(vii) 


IMMEDIATE SOURCE: 


3X7 




(B) CLONE: Hydrolysis product T 29 








521 






522 


(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 13: 


523 






524 


His 


Tyr Phe Glu lie Asp Leu Ser Trp His : 


525 


1 


5 10 


526 






527 


(2) INFORMATION FOR SEQ ID NO: 14: 


528 






529 


(i) 


SEQUENCE CHARACTERISTICS: 


530 




(A) LENGTH: 11 amino acids 
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531 (B) TYPE: amino acid 

532 (C) STRANDEDNESS: single 

533 (D) TOPOLOGY: linear 
534 

535 (ii) MOLECULE TYPE: peptide 

536 

537 (iii) HYPOTHETICAL: NO 

538 

539 

540 (vii) IMMEDIATE SOURCE: 

541 (B) CLONE: Hydrolysis product T 31 
542 

543 

544 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

545 

546 Ser Thr Asn Ser Gin Phe Trp Oly Phe Leu Arg 

547 1 5 10 
548 

549 (2) INFORMATION FOR SEQ ID NO: 15: 
550 

551 (i) SEQUENCE CHARACTERISTICS: 

552 (A) LENGTH: 16 amino acids 

553 (B) TYPE: amino acid 

554 (C) STRANDEDNESS: single 

555 (D) TOPOLOGY: linear 
556 

557 (ii) MOLECULE TYPE: peptide 

558 

559 (iii) HYPOTHETICAL: NO 

560 

561 

562 (vii) IMMEDIATE SOURCE: 

563 (B) CLONE: Hydrolysis product T 32 
564 

565 

566 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

567 

568 Gin Asn Pro Val Thr Pro Pro Glu Leu Phe Gly Ser lie Leu Gly Thr 

569 15 10 15 
570 

571 

572 (2) INFORMATION FOR SEQ ID NO: 16: 
573 

574 (i) SEQUENCE CHARACTERISTICS: 

575 (A) LENGTH: 16 amino acids 

576 (B) TYPE: amino acid 

577 (C) STRANDEDNESS: single 

578 (D) TOPOLOGY: linear 
579 

580 (ii) MOLECULE TYPE: peptide 

581 

582 (iii) HYPOTHETICAL: NO 

583 
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584 

585 (vii) IMMEDIATE SOURCE: 

586 (B) CLONE: Hydrolysis product I 33 
587 

588 

589 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

590 

591 Gin Asn Pro Val Thr Pro Pro Olu Leu Phe Gly Ser He Leu Gly Thr 

592 1 5 10 15 
593 

594 

595 (2) INFORMATION FOR SEQ ID NO: 17: 
596 

597 (i) SEQUENCE CHARACTERISTICS: 

598 (A) LENGTH: 25 amino acids 

599 (B) TYPE: amino acid 

600 (C) STRANDEDNESS: single 

601 (D) TOPOLOGY: linear 
602 

603 (ii) MOLECULE TYPE: peptide 

604 

605 (iii) HYPOTHETICAL: NO 

606 

607 

608 (vii) IMMEDIATE SOURCE: 

609 (B) CLONE: Hydrolysis product V 1 
610 

611 

612 (xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 17: 

613 

614 Tyr Ser Leu Pro Asn Lys Ala Asp Asn Ser Val He Val Asp Thr Asp 

615 15 10 15 
616 

617 Ser He Lys Asn Thr He Tyr He Thr 

618 20 25 
619 

620 (2) INFORMATION FOR SEQ ID NO: 18: 
621 

622 (i) SEQUENCE CHARACTERISTICS: 

623 (A) LENGTH: 16 amino acids 

624 (B) TYPE: amino acid 

625 (C) STRANDEDNESS: single 

626 (D) TOPOLOGY: linear 
627 

628 (ii) MOLECULE TYPE: peptide 

629 

630 (iii) HYPOTHETICAL: NO 

631 

632 

633 (vii) IMMEDIATE SOURCE: 

634 (B) CLONE: Hydrolysis product V 2 
635 

636 
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637 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

638 

639 Val Thr Leu Lys Thr Phe Ala Olu Asp Asn Ser Ala Ser Val Oln Ala 

640 15 10 15 
641 

642 (2) INFORMATION FOR SEQ ID NO: 19: 
643 

644 (i) SEQUENCE CHARACTERISTICS: 

645 (A) LENGTH: 24 amino acids 

646 (B) TYPE: amino acid 

647 (C) STRANDEDNESS: single 

648 (D) TOPOLOOY: linear 
649 

650 (ii) MOLECULE TYPE: peptide 

651 

652 (iil) HYPOTHETICAL: NO 

653 

654 

655 (vii) IMMEDIATE SOURCE: 

656 (B) CLONE: Hydrolysis product V 3 
657 

658 

659 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

660 

661 Thr Ser Tyr Thr Lys Ala Asp Asn Ser Val lie Val Asp Thr Asp Ser 

662 15 10 15 
663 

664 lie Lys Asn Thr lie Tyr lie Thr 

665 20 
666 

667 (2) INFORMATION FOR SEQ ID N0:20: 
668 

669 (i) SEQUENCE CHARACTERISTICS: 

670 (A) LENGTH: 28 amino acids 

671 (B) TYPE: amino acid 

672 (C) STRANDEDNESS: single 

673 (D) TOPOLOOY: linear 
674 

675 (ii) MOLECULE TYPE: peptide 

676 

677 (iii) HYPOTHETICAL: NO 

678 

679 

680 (vii) IMMEDIATE SOURCE: 

681 (B) CLONE: Hydrolysis product V 5 
682 

683 

684 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

685 

686 Gly Lys Oly lie Asp lie Lys Ser Ser Leu Ser Gly Leu Thr Val Leu 

687 15 10 15 
688 

689 Lys Ser Thr Asn Ser Gin Phe Trp Gly Phe Leu Arg 
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690 20 25 

691 

692 (2) INFORMATION FOR SEQ ID NO: 21: 
693 

694 (i) SEQUENCE CHARACTERISTICS: 

695 (A) LENGTH: 17 amino acids 

696 (B) TYPE: amino acid 

697 (C) STRANDEDNESS: single 

698 (D) TOPOLOGY: linear 
699 

700 (ii) MOLECULE TYPE: peptide 

701 

702 (iii) HYPOTHETICAL: NO 

703 

704 

705 (vii) IMMEDIATE SOURCE: 

706 (B) CLONE: Hydolysis product V 6 
707 

708 

709 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

710 

711 Gly Lys Gly lie Asp lie Lys Ser Ser Leu Ser Gly Leu Thr Val Leu 

712 15 10 15 
713 

714 Lys 

715 

716 

717 (2) INFORMATION FOR SEQ ID NO: 22: 
718 

719 (i) SEQUENCE CHARACTERISTICS: 

720 (A) LENGTH: 1236 base pairs 

721 (B) TYPE: nucleic acid 

722 (C) STRANDEDNESS: single 

723 (D) TOPOLOGY: linear 
724 

725 (ii) MOLECl]LE TYPE: DNA (genomic) 

726 

727 (iii) HYPOTHETICAL: NO 

728 

729 

730 (vii) IMMEDIATE SOURCE: 

731 (B) CLONE: Fragment 3 
732 

733 

734 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

735 

736 GATCCGCGGA AGCATAAAGT GTAAAGCCT6 GGGTGCCTAA TGAGTGA6CT AACTTACATT 60 
737 

738 AATTGC6TTG CGCTCACTGC CC6CTTTCCA GTCGG6AAAC CT6TCOTOCC AGCTGCATTA 120 
739 

740 ATOAATCGGC CAACGCGCGG GGAGAGOCGO TTT6COTATT GGOCGCCAGG GTGGTTTTTC 180 
741 

742 TTTTCACCAO TOAGACGGOC AACAOCTGAT TGCCCTTCAC CGCCTGGCCC TGAGAGAGTT 240 
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743 

744 6CAOCAAGCG GICCACGGT6 GXTTOCCCCA CCACCCGAAA ATCCTGITTO AIG6TG6ITA 
745 

746 ACGGCGGGAT ATAACAZGAO CIGTCITCGG TATCGTCGTA TCCCACTACC GAGAIATCC6 
747 

748 CACCAACGCO CAGCCCOGAC TCGGIAATGO CGCGCATIGC OCCCAOCOCC ATCTGATCGT 
749 

750 TGGCAACCAG CATC6CAGTG GGAACOA!I6C CCTCATTCAO CATTTGCATG GXTTGITGAA 
751 

752 AACCGGACAT GGGACICCA6 TCGCCTTCCC OTTCCGCIAT COGCTGAAIT XGATTGCGAG 
753 

754 TOAGATATTT AT6CCAGGCA GCCAGACGCA GACGCOCCGA GACAGAACTT AAIGGGCCC6 
755 

756 CTAACAGCGC GATTTGCIOO TGACCCAAIG CGACCAGATG CTCCACGCCC AGICGCGTAC 
757 

758 CGTCTTCATG GGAGAAAATA AIACTGTTGA TGOGTGICTG GTCA6AGACA TCAAGAAATA 
759 

760 ACGCC6GAAC ATTAGIGGA6 GCAGCITCCA CAGCAAXG6C ATCCTG6ICA ICCAGCGGAT 
761 

762 A6TTAAIGAX CAGCCCACTG ACGCGIT6CG CGAGAAGATT GIGCACCGCC GCTTTACA6G 
763 

764 CTTCGAC6CC GCTTC6TTCT ACCATCGACA CCACCACGCT GGCACCCAGT TGAICGGC6C 
765 

766 GAGATTTAAT CGCCGCGACA ATTTGCOACG GCGCGTGCAO GGCCAGACIO GAGGTGGCAA 
767 

768 COCCAAICAO CAACGACTGT TTGCCCGCCA GIIGITGTGC CACGCOOTTG GGAATGIAAT 

769 XCAGCTCCGC CAICGCCGCT TCCACTTTTT CCCGCOTTTT CGCA6AAACG IGGCXGGCCT 
770 

771 GGTTCACCAC 6CGGGAAACG GTCTGAIAAC AGACACCGOC ATACTCXGCG ACAICGIATA 
772 

773 ACGTTACTGG TTTCACATIC ACCACCCTGA ATTGACTCIC TTCCGGGCGC IATCAI6CCA 
774 

775 TACCGCGAAA GGTTTTGCGC CATTCGAIG6 TGTCCG 
776 

777 (2) INFORMATION FOR SEQ ID NO: 23: 
778 

779 (i) SEQTJENCE CHARACTERISTICS: 

780 (A) LENGTH: 326 base pairs 

781 (B) TYPE: nucleic acid 

782 (C) STRANDEDNESS: single 

783 (D) TOPOLOGY: linear 



300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 
1080 

1140 

1200 

1236 



784 
785 
786 
787 
788 
789 
790 
791 
792 
793 
794 
795 



(vii) 



(iii) 



(xi) 



(ii) 



MOLECULE TYPE: DNA (genomic) 



SEQTJENCE DESCRIPTION: SEQ ID NO: 23: 



HYPOTHETICAL: NO 



IMMEDIATE SOURCE: 
(B) CLONE: Fragment 4 
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TC6A6CT6AC T6ACCT6ITO CTTATATTAC AIC6ATAGCT TAOC6TAXAA TGTOT6GAAT 60 

IOT6AGCOAT AACAAITTCA CACAGTTTAA CITTAAOAA6 OAOATAIACA TATGOCTACC 120 

00AICCC60A CTAGICTGCI CCTG6CTTTT GOCCTGCTCT OCCTOCCCTG OCTTCAAOAO 180 

GOCAOTOCCT ICCCAACCAT TCCCITATCI AOACTTTTTO ACAACOCTAT GCTCC6COCC 240 

CATCOTCTOC ACCA0CT06C CTTTOACACC TACCAOOAGT XTGAAOAAOC CIATAICCCA 300 
AAGOAACAOA AOTATZCATT CCTOCA 326 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Clal-Ndel fragment 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:24: 
COATAOCOTA TAAT6T0TGG AATTGTOAGC G6ATAACAAT TTCACACAGT TTTTCGC6AA 60 
OAAGGAGATA TACA 74 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 163 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Synthetic hGH gene end fragment 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
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849 OAICITCAAO CAOACCTACA GCOACOCACT ACICAAOAAC TACOOOCTOC ICTACTGCIT 60 
850 

851 CAOOAAOOAC AT06ACAAOG TC6AGACAIT CCIOC6CATC GIOCA6IGCC GCTCIGTOGA 120 
852 

853 GGGCAGCTGT OGCTTCTA6T AAGOTACCCT GCCCTACGTA CCA 163 
854 

855 (2) INFORMATION FOR SEQ ID NO: 26: 
856 

857 (i) SEQUENCE CHARACTERISTICS: 

858 (A) LENGTH: 48 base pairs 

859 (B) TYPE: nucleic acid 

860 (C) STRANDEDNESS: single 

861 (D) TOPOLOGY: linear 
862 

863 (ii) MOLECULE TYPE: DNA (genomic) 

864 

865 (iii) HYPOTHETICAL: NO 

866 

867 

868 (vii) IMMEDIATE SOURCE: 

869 (B) CLONE: Ndel-AccII synthetic fragment 
870 

871 

872 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

873 

874 TATGTCTGCG GTAAAAGCAO CGCGCTACG6 CAAGGACAAT 6TTCGCGT 48 
875 

876 (2) INFORMATION FOR SEQ ID NO: 27: 
877 

878 (i) SEQUENCE CHARACTERISTICS: 

879 (A) LENGTH: 361 base pairs 

880 (B) TYPE: nucleic acid 

881 (C) STRANDEDNESS: single 

882 (D) TOPOLOGY: linear 
883 

884 (ii) MOLECULE TYPE: DNA (genomic) 

885 

886 (iii) HYPOTHETICAL: NO 

887 

888 

889 (vii) IMMEDIATE SOURCE: 

890 (B) CLONE: Plasmid pEMR469 fragment with AOH2 promoter 
891 

892 

893 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

894 

895 G6GACGCGTC TCCTCTGCCG GAACACCGGG CATCTCCAAC TTATAAGTTG GAGAAATAAG 60 
896 

897 AGAATTTCAG ATTGAGAGAA TGAAAAAAAA AAAAAAAAAA AAOGCAGAOG AGAGCATAGA 120 
898 

899 AATGGGGTTC ACTTTTTGGT AAAGCTATAG CATGCCTATC ACATATAAAT AGA6TGCCAG 180 
900 

901 TAGCGACTTT TTTCACACTC GAGATACTCT TACTACTGCT CTCTTGTTGT TTTTATCACT 240 
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902 

903 TCTTGTTTCT TCTTT06IAA ATA6AATATC AAOCTACAAA AAOCAIACAA TCAACTATCA 300 
904 

905 ACIATTAACT ATATCOAIAC CATATGGATC C6ICOACTCT AOAGOAIC6X COACTCTA6A 360 
906 

907 O 361 
908 

909 (2) INFORMATION FOR SEQ ID NO: 28: 
910 

911 (i) SEQUENCE CHARACTERISTICS: 

912 (A) LENGTH: 58 base pairs 

913 (B) TYPE: nucleic acid 

914 (C) STRANDEDNESS: single 

915 (D) TOPOLOGY: linear 
916 

917 (ii) MOLECinLE TYPE: ONA (genomic) 

918 

919 (iii) HYPOTHETICAL: NO 

920 

921 

922 (vii) IMMEDIATE SOtJRCE: 

923 (B) CLONE: Fragment C 
924 

925 

926 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

927 

928 COATATACAC AATOTCTOCT 6TTAAGGCTO CTAGATACGG TAAGOACAAC GTTAGA6T 58 
929 

930 (2) INFORMATION FOR SEQ ID NO: 29: 
931 

932 (i) SEQUENCE CHARACTERISTICS: 

933 (A) LENGTH: 1013 base pairs 

934 (B) TYPE: nucleic acid 

935 (C) STRANDEDNESS: single 

936 (D) TOPOLOGY: linear 
937 

938 (ii) MOLECULE TYPE: DNA (genomic) 

939 

940 (iii) HYPOTHETICAL: NO 

941 

942 

943 (vii) IMMEDIATE SOURCE: 

944 (B) CLONE: Fragment D 
945 

946 

947 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

948 

949 CTACAAGGTT CACAAOOACC ACAA0ACC06 TOTCCAGACG GTGTACGAOA TGACCGICX6 60 
950 

951 TGTGCTTCTO GAGGGT6AGA TTGAOACCTC TTACACCAAO GCCGACAACA GCGTCATT6T 120 
952 

953 CGCAACCGAC TCCATTAAOA ACACCATTTA CATCACCGCC AAOCAOAACC CCGTTACTCC 180 
954 
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955 ICCC6AOCTO TTCGGCTCCA TCCTGGOCAC ACACTTTCAT TOAOAA6XAC AACACATCCA 240 
956 

957 T6CCGCICAC OTCAACAITO TCIOCCACCO CI6GACCCGG AXGOACATTO ACGOCAAGCC 300 
958 

959 ACACCCTCAC TCCTTCAICC 6COACAOCGA GGAGAAOCGG AATGT6CAGO TGGACOTGGT 360 
960 

961 CGAGGGCAAG GOCATC6AIA TCAAOTC6TC TCTOICCOGC CTOACC6TOC T6AAGAGCAC 420 
962 

963 CAACTCGCAG ITCTGGGGCT TCCTGCGTGA CGAGIACACC ACACITAAGG AGACCTOGGA 480 
964 

965 CCGIAICCTG AGCACCGACG XCGAIOCCAC TIGGCA6I6G AAGAAITTCA GZGGACICCA 540 
966 

967 GGAGGICCGC TCOCACGIGC CTAAOXTCGA TOCIACCTGG GCCACTGCTC OCGAGGTCAC 600 
968 

969 ZCIGCCGACX TTTGCTGAAG AIAACAGTGC CAOCGZGCAG GCCACIAIGT ACAAGATGGC 660 
970 

971 AGAGCAAAIC CTGGCGCOCC AGCAGCT6AT CGAGACTGTC GAGIACICGZ IGCCTAACAA 720 
972 

973 GCACTAXTTC 6AAAICGACC T6AGCTGGCA CAAGGGCCTC CAAAACACCG GCAAGAACGC 780 
974 

975 CGAGGTCTTC GCTCCTCA6T CGGACCCCAA CGGTCT6AIC AAGTGTACCG TCGGCCGGTC 840 
976 

977 CTCTCXGAAG TCTAAAITGZ AAACCAACAT GATICICACG TTCCGGAGIT TCCAAGGCAA 900 
978 

979 ACIGTATATA GZCTGGGAIA GGGTATAGCA TTCATTCACT TGT TT T TT AC TTCCAAAAAA 960 
980 

981 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAA6GGC CCG 1013 
982 

983 (2) INFORMATION FOR SEQ ID NO: 30: 
984 

985 (i) SEQUENCE CHARACTERISTICS: 

986 (A) LENGTH: 207 base pairs 

987 (B) TYPE: nucleic acid 

988 (C) STRANDEDNESS: single 

989 (D) TOPOLOGY: linear 
990 

991 (ii) MOLECULE TYPE: DNA (genomic) 

992 

993 (iii) HYPOTHETICAL: NO 

994 

995 

996 (vii) IMMEDIATE SOURCE: 

997 (B) CLONE: Synthetic GAL7 fragment 
998 

999 

1000 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

1001 

1002 C6COTCTATA CTTCOOAOCA CT6TTOAGCO AA60CTCATT AOATATATTT XCT6TCATTT 60 
1003 

1004 TCCTTAACCC AAAAATAAGG GAGA6GGTCC AAAAAGC6CT CGGACAACT6 TT6ACCGTGA 120 
1005 

1006 TCCGAAGGAC TGGCTATACA GI6TTCACAA AATAOCCAAO CIGAAAATAA T6TGTA0CCT 180 
1007 
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1008 
1009 
1010 
1011 
1012 
1013 
1014 
1015 
1016 
1017 
1018 
1019 
1020 
1021 
1022 
1023 
1024 
1025 
1026 
1027 
1028 
1029 
1030 
1031 
1032 
1033 
1034 
1035 
1036 
1037 
1038 
1039 
1040 
1041 
1042 
1043 
1044 
1045 
1046 
1047 
1048 
1049 
1050 
1051 
1052 
1053 
1054 
1055 
1056 
1057 
1058 
1059 
1060 



TTAOCXAIOT TCA6ITAOTT IGOCAIO 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOIX>6Y: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Modified Zbal-Mlul adapter 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

CTAG6CTAOC GOGCCCOCAT OCA 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Plasmid pSEl "site binding to Hindlll' 
fragment 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
AGCTG6CTCG CATCTCTCCT TCACGCGCCC OCCOCCCTAC CTGAGGCCGC CATCCACGCC 
GGTGAOTCGC GTTCT6CCGC CTCCC6CCTG IGGTGCCTCC T6AACTGCGT CCGCC6TCTA 
GGTAOGCTCC AAGGGAGCCG GACAAAGGCC CGGTCTCGAC CTGAGCTCTA AACTTACCTA 
GACTCAGCCG GCTCTCCACG CTTTGCCTGA CCCTGCTTGC TCAACTCTAC OTCTTTGTTT 
CGTTTTCTGT TCTGCGCCGT TACAACTTCA A6GTATGCGC TGGOACCTGG CAGGCGGCAT 
CTGGOACCCC TAOGAAGGGC TT6GGGOTCC ICGTGCCCAA GGCAGGGAAC ATAGIGGTCC 



207 



23 



60 
120 
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240 
300 
360 
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CAOOAA6GGG AGCAOAOOCA TCAG06T0TC CACIITGTCI CCGCAGCICC T6A6CCIGCA 420 
GA 422 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Synthetic HindIII-"site binding to BamHI 
fragment 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
AOCTTGTCGA CTAATACGAC TCACTATAGG GCGGCCGCOO GCCCCTGCAG GAATTC6GAT 60 



CCCCCGGGTG ACT6ACT 

(2) INFORMATION FOR SEQ ID NO: 34: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



77 



(vii) IMMEDIATE SOXIRCE: 

(B) CLONE: Synthetic Hindlll-AccI fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

AGCTT6CCGC CACTATGTCC 6CAGTAAAAG CAGCCC6CTA CGGCAAGGAC AATGTCC6CG 60 

T 61 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 920 base pairs 
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1114 (B) TYPE: nucleic acid 

1115 (C) STRANDEDNESS: single 

1116 (D) TOPOLOGY: linear 
1117 

1118 (ii) MOLECULE TYPE: DNA (genomic) 

1119 

1120 (iii) HYPOTHETICAL: NO 

1121 

1122 

1123 (vii) IMMEDIATE SOURCE: 

1124 (B) CLONE: Hindlll-SnaBI fragment 
1125 

1126 

1127 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

1128 

1129 AGCTT6CCGC CACTATGTCC GCAGTAAAAO CAGCCCGCTA CGGCAAGGAC AATGTCCGCG 60 
1130 

1131 TCTACAAGGT TCACAAGGAC GAGAAGACCG GT6TCCA6AC G6TGTACOA6 ATGACCGTCT 120 
1132 

1133 GTGTGCTTCT GGAGGGTGAG ATTGAGACCT CTTACACCAA GGCCGACAAC AGCGTCATTG 180 
1134 

1135 TCGCAACCGA CTCCATTAAG AACACCATTT ACATCACCGC CAAGCAGAAC CCC6TTACTC 240 
1136 

1137 CTCCCGAGCT GTTCGGCTCC ATCCTGG6CA CACACTTCAT TGAGAA6TAC AACCACATCC 300 
1138 

1139 ATGCCGGTCA CGTCAACATT GTCTGCCACC GCTGGACCCG GATGGACATT OACGGCAAGC 360 
1140 

1141 CACACCCTCA CTCCTTCATC C6CGACAGCG AGGAGAAGCG GAATGTOCAG GTGGACGTG6 420 
1142 

1143 TCGAGGOCAA GGGCATCGAT ATCAAGTCGT CTCTGTCCOO CCTGACCGTG CTOAAGAGCA 480 
1144 

1145 CCAACXCGCA GTTCTGOGGC TTCCTGC6TG ACOAGTACAC CACACTTAAG GAGACCTGGG 540 
1146 

1147 ACCOTATCCT GAGCACCGAC GTCGATOCCA CTTGGCAOTG GAAGAATTTC AGTGGACTCC 600 
1148 

1149 AGGAGGTCCG CTC6CACGTG CCTAAGTTCG ATGCTACCIG GGCCACTGCT CGC6AGGTCA 660 
1150 

1151 CTCTGAAGAC TTTTGCTGAA GATAACAGTG CCAGCGIGCA OGCCACTATG TACAAGATGG 720 
1152 

1153 CAGAGCAAAT CCTGGCGCGC CAGCAGCTGA TCGAGACTGT CGAGTACTCG TTGCCTAACA 780 
1154 

1155 AGCACTATTT C6AAATCGAC CTGAGCTGGC ACAAGGGCCT CCAAAACACC GGCAAGAACG 840 
1156 

1157 CCGAGGTCTT CGCTCCTCAO ICGOACCCCA ACGGTCTGAX CAAGTGTACC GTCGGCCGGT 900 
1158 

1159 CCTCTCTGAA GTCTAAATTG 920 
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1079 Response Exceeds Line Limitations 
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(A) APPLICATION NUMBER: US 07/659,408 
prokaryotes 
eukaryotes 
animal cells 
cells 
fragment 
fragment 
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